Visual Person Localization with Dynamic Neural Fields: towards a Gesture Recognition System
نویسندگان
چکیده
For any visually-based interaction between persons and acting systems within a real-world environment the localization of a user by the system is a necessary condition. The presented work deals with this visual loca-lization problem of a user concretely referred to the autonomous mobile robot system MILVA of our department. Since this system is applied under real-world conditions especially for the localization some proper techniques are needed which have an adequate robustness. In our opinion, this requires the combination of several components of saliency towards a multi-cue approach, consisting of structure-and color-based features 2]. This paper introduces one of them: the localization based on a typical shape of contour. A simple contour shape prototype model consists of an arrangement of oriented lters doing a piecewise approximation of the upper shape (head, shoulder) of a frontally aligned person. Applying such lter arrangement in a multiresolution manner, this leads to a robust localization of frontally aligned persons even in depth. The central problem of selecting the most promising (salient) image region is treated by means of a three-dimensional dynamic neural eld performing a dynamic winner-take-all process (WTA, 1, 6]). After a successful localization of a person one can start a more detailed analysis of the gesture's meaning: besides the recognition of static gestures we also concentrate on the acquisition and later the recognition of dynamic gestures.
منابع مشابه
Person Localization and Posture Recognition for Human-Robot Interaction
The development of a hybrid system for (mainly) gesture-based human-robot interaction is presented, thereby describing the progress in comparison to the work shown at the last gesture workshop (see 2]). The system makes use of standard image processing techniques as well as of neural information processing. The performance of our architecture includes the detection of a person as a potential us...
متن کاملEMG-based wrist gesture recognition using a convolutional neural network
Background: Deep learning has revolutionized artificial intelligence and has transformed many fields. It allows processing high-dimensional data (such as signals or images) without the need for feature engineering. The aim of this research is to develop a deep learning-based system to decode motor intent from electromyogram (EMG) signals. Methods: A myoelectric system based on convolutional ne...
متن کاملMachine learning based Visual Evoked Potential (VEP) Signals Recognition
Introduction: Visual evoked potentials contain certain diagnostic information which have proved to be of importance in the visual systems functional integrity. Due to substantial decrease of amplitude in extra macular stimulation in commonly used pattern VEPs, differentiating normal and abnormal signals can prove to be quite an obstacle. Due to developments of use of machine l...
متن کاملFeature Learning for Conditional Random Fields and its Application to Gesture Recognition
Conditional random fields (CRFs) have been successful in many sequence labeling tasks, which conventionally rely on a hand-craft feature representation of input data. However, a powerful data representation could be another determining factor of the performance, which has not attracted enough attention yet. We describe a novel sequence labeling framework for gesture recognition, which builds a ...
متن کاملHand Gesture Recognition from RGB-D Data using 2D and 3D Convolutional Neural Networks: a comparative study
Despite considerable enhances in recognizing hand gestures from still images, there are still many challenges in the classification of hand gestures in videos. The latter comes with more challenges, including higher computational complexity and arduous task of representing temporal features. Hand movement dynamics, represented by temporal features, have to be extracted by analyzing the total fr...
متن کامل